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Abstract 

Background: Fusarium graminearum sensu stricto (s.s.) is an ubiquitous pathogen of cereals. The economic impact 
of Fusarium head blight (FHB) is characterized by crop losses and mycotoxin contamination. Our objective was to 
associate SNP diversity within candidate genes with phenotypic traits. A total of 77 F. graminearum s.s. isolates was 
tested for severity of fungal infection (= aggressiveness) and deoxynivalenol (DON) production in an inoculated 
field experiment at two locations in each of two years. For seven genes known to control fungal growth {Met API, 
Erf2) or DON production {TRI1, TRI5, TRI6 TRI10 and TRIM) single nucleotides polymorphic sites (SNPs) were 
determined and evaluated for the extent of linkage disequilibrium (LD). Associations of SNPs with both phenotypic 
traits were tested using linear mixed models. 

Results: Decay of LD was in most instances fast. Two neighboring SNPs in MetAPi and one SNP in Erf2 were 
significantly {P < 0.05) associated with aggressiveness explaining proportions of genotypic variance (p G ) of 25.6%, 
0.5%, and 13.1%, respectively. One SNP in TRII was significantly associated with DON production (p 6 = 4.4). 

Conclusions: We argue that using the published sequence information of Fusarium graminearum as a template to 
amplify comparative sequence parts of candidate genes is an effective method to detect quantitative trait loci. Our 
findings underline the potential of candidate gene association mapping approaches to identify functional SNPs 
underlying aggressiveness and DON production for F. graminearum s.s populations. 

Keywords: Aggressiveness, Association mapping, DON, Fusarium graminearum, FHB, Linkage disequilibrium, QTL, 
Triticum aestivum, SNP 



Background 

Fusarium head blight (FHB) is a destructive disease to 
cereals including wheat and barley. Fusarium grami- 
nearum (teleomorph Gibberella zeae) is considered to 
be the main causal agent of this disease in addition to 
other species such as F. culmorum [1]. F. graminearum 
complex has been subdivided into several cryptic species 
[2], the main member in Germany is F. graminearum 
sensu stricto (s.s.) [3]. FHB disease leads to prematurely 
bleached spikes in infected plants [1] with considerable 
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yield losses and contamination by mycotoxins such as 
deoxynivalenol (DON) [4]. DON is the most common 
food and feed contaminant in Europe and the maximum 
permissible level in unprocessed wheat used for food is 
1.25 mg kg 1 [5]. 

The quantitative ability of an isolate to cause disease 
on a susceptible host plant in a non-race specific patho- 
system is defined as aggressiveness [6]. Aggressiveness is 
an important factor determining the potential ability of 
an isolate to cause yield losses. Large genetic variation 
of aggressiveness, type of mycotoxin, and DON produc- 
tion was found among isolates sampled in the same 
country or even the same field [7-11]. The molecular 
causes of this tremendous variation in aggressiveness 
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and DON production in F. graminearum are still 
unclear. To date, only one QTL (quantitative trait locus) 
study from a single cross was reported detecting at least 
two QTL for aggressiveness [12]. It was shown before, 
that a large amount of molecular variation was found 
within field populations of F. graminearum s.s. using 
SSR (simple sequence repeat) markers. Comparing 12 
individual field populations each consisting of 30 iso- 
lates, Talas et al. [3] reported that 71% of the molecular 
variation was assigned within populations and 29% 
between populations. A cause for this high diversity 
might be the mating system. F. graminearum is a hap- 
loid, homothallic fungus and propagates asexually by 
conidia and sexually by ascospores allowing both selfing 
and outcrossing [7]. With the availability of the full 
genomic sequence of F. graminearum [13], it is now 
possible to deeper analyze this variation by a candidate 
gene approach. An array of candidate genes, including 
components of transcription, signal transduction, host- 
specific nutrition, host infection/colonization and tri- 
chothecene biosynthesis is available in the version 3.2 of 
the Pathogen-Host Interactions database [14]. The can- 
didate genes we were aiming for are known to control 
DON biosynthesis and/or aggressiveness (TRI1, TRIS, 
TR16, TRI10, TRIM) [15,16] or are expected to have a 
link to aggressiveness but are yet uncharacterized for 
Fusarium spp. (MetAPl, Erf2) [17,18]. Candidate gene 
association mapping is a sensitive tool if the mapping 
resolution is high enough [19]. The resolution of asso- 
ciation mapping depends on the extent of linkage dise- 
quilibrium (LD), i.e. the non-random association of 
alleles present in a species [20]. Low LD was reported in 
F. graminearum estimated by different types of genetic 
makers such as VNTR, AFLP, RFLP [7,11,21], but no 
data on LD within and among genes is available. 
Because LD is species and population specific, it should 
consequently be determined before conducting an asso- 
ciation mapping study. 

The specific objectives of our study were to: (i) inves- 
tigate the nucleotide diversity on gene level, (ii) investi- 
gate the extent of LD between single nucleotide 
polymorphic sites (SNPs) within and among candidate 
genes, (iii) identify SNPs of candidate genes Erf2 and 
MetAPl for testing whether they affect the quantitative 
variation for aggressiveness, and (iv) evaluate associa- 
tions of SNPs in TRI1 and other four genes of the TRIS 
cluster with variation in DON production. 

Results 

Phenotypic analysis 

Phenotypic data were analyzed in detail in a companion 
study [9]. All 77 F. graminearum s.s. isolates produced 
symptoms of FHB disease in all environments. Briefly, 
genotypic proportion of phenotypic variance of the 



isolates was significant {P < 0.01) for both traits: Aggres- 
siveness measured as FHB rating on scale from 0 to 100 
and DON production measured as DON concentration 
in wheat kernels in mg kg" 1 . Isolate x environment 
interaction variance was significant (P < 0.01) only for 
mean FHB rating but not for DON production. Entry- 
mean heritability was moderate for mean FHB rating 
(0.55) and DON production (0.62). 

Histograms based on best linear unbiased estimators 
(BLUEs) followed a normal distribution for both traits 
(Figure 1). The mean of FHB infection among all iso- 
lates was 30.6% ranging from 18.4 to 38.9%. Average 
DON production was 12.6 mg kg" 1 ranging from 3.5 to 
21.6 mg kg" . One isolate (WET24) produced a very low 
concentration of DON (0.3 mg kg" 1 ) and was found to 
be a nivalenol producer by chemotype-specific primers. 

Analysis of population structure and diversity 

Principal coordinate analysis (PCoA) based on modified 
Rogers' distances between all isolates did not show a 
distinct separation of the isolates sampled from different 
locations (Figure 2A). Explained variance gradually 
decreased according to the first ten principal coordi- 
nates (Figure 2B). The violin plot (Figure 2C) had a con- 
tinuous density of distribution over all ten principal 
coordinates without any division within the principal 
coordinates. Genetic similarity ranged from 0.057 to 1.0 
with a mean value of 0.31 (Figure 2D). 

Estimation of nucleotide diversity and linkage 
disequilibrium 

Percentage of polymorphic sites per total sequenced 
region of each gene (without singletons) varied from 
0.9% (5/513) on TRIlOb to 8.8% (65/734) on MetAPl 
indicating a high nucleotide diversity in most tested 
genes (Table 1). LD of SNPs within the gene MetAPl 
decayed rapidly within 200 bp of physical distance, i.e., 
the robust locally fitted regression of r 2 values has a 
trend to decay from r 2 = 0.35 to r 2 < 0.1, whereas LD 
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Figure 1 Histograms for 77 F. graminearum isolates. Best linear 
unbiased estimates (BLUEs) for mean FHB rating (A) and DON 
content (B) calculated across four environments (location x year 
combinations) 
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Figure 2 Population structure and familial reiatedness (A) 

Principal coordinate analysis of the 77 isolates, based on modified 
Rogers' distance estimates whereas location A = Hohenheim, 
location B = Schickelsheim, and location C = Wetze, (B) Explained 
genotypic variance of the first ten principal coordinates, (C) Violin 
plot showing the density distribution of the first ten principa 
coordinates, (D) Histogram of the genetic similarities among the 77 
isolates. 



within the gene Erf2 had r 2 values ranging from 0.8 to 
0.2 and the regression of r 2 trends to decay already after 
150 bp (Figure 3). LD with r 2 values higher than 0.1 
were detected between all allele combinations within the 
tested genes followed by a rapid decay negatively corre- 
lated with the physical distance in base pair (Figure 4). 
In line with the rapid decay, LD between genes located 
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Figure 3 Linkage disequilibrium (LD) decay Robust locally fitted 
regression of linkage disequilibrium over base-pair distance within 
the two candidate genes, MetAPl (25 SNPs) and Erf2 (1 1 SNPs). 



on the same chromosome (i.e., TRI1 and MetAPl) is 
low (r 2 < 0.1). Interestingly, 48%, 19%, and 45% of the 
SNP pairs between the genes TRI10I MetAPl, MetAPl I 
TRI5 and MetAPl I Erf2, respectively, have higher values 
of r 2 than 0.1, although they are located on different 
chromosomes. Low r 2 values (< 0.1) were observed for 
SNPs of gene pairs TRIlOIErfi, and TRIS/Erfi, whereas 
r 2 values of 0.2 were detected between SNP pairs of 
TRIlOb and TRI5, followed by a rapid decay of LD. 

Association analysis for aggressiveness and mycotoxin 

Two adjacent SNPs of the gene MetAPl were signifi- 
cantly associated (P < 0.05 using Bonferroni-Holm cor- 
rection) [22] with mean FHB rating with an explained 
genotypic proportion of variance p G = 25.6 and 0.5, 
respectively (Table 2, Figure 5A). Additionally, one SNP 
in the gene Erf2 was significantly associated with this 
trait showing a p G of 13.1% (Table 2, Figure 5A). A sin- 
gle SNP significantly associated with DON production 
was identified in the TRI1 gene explaining 4.4% of the 



Table 1 Sequenced regions of candidate genes and its related nucleotide polymorphism 


Gene ID a 


Sequenced region relative to 
the ATG 


No. of SNPs 
detected 


No. of 
singletons 


Nucleotide diversity 

(%) b 


No. of SNPs with allele 
frequency > 0.1 


TRI1 

(FGSG_00071) 


311-1065 




54 


2.5 


3 


TRI6 

(FGSGJ6251) 


35-558 


36 


29 


1.3 


0 


TRI5 

(FGSG_03537) 


479-1080 


70 


38 


5.3 


A 


TRI10 a 

(FGSG_03538) 


31-678 


123 


80 


6.6 


0 


TRIW b 
(FGSG_03538) 


760-1273 


34 


29 


0.9 


1 


TRIM 

(FGSG_03543) 


277-976 


43 


23 


2.9 


0 


MetAPl 
(FGSG_01397) 


200-934 


106 


-11 


8.8 


25 


Erf2 

(FGSG_08531) 


1193-1825 


80 


42 


6.0 


11 



a The given gene ID is the entry number of the MIPS F. graminearum genome database (FGDB; Wong et al. 2011) 

b Nucleotide diversity is the frequency of SNPs (without singletons) relative to the total length of the sequenced gene region 
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Figure 4 Pairwaise linkage disequilibrium (LD) The structure of 
LD within and among five candidate genes for 77 Fusarium isolates. 
Significant LD (P < 0.05, above diagonal) and LD measured as r 2 
between all pairs of selected SNP loci (below diagonal). The 
horizontal and vertical lines separate the candidate genes, red 
coloring indicates significant LD and the higher the r 2 values (r 2 > 
0.1), the darker is the used color, white indicates non-significant LD 
or i 2 = 0. 



genotypic variance of DON production (Table 2, Figure 
5B). All detected SNPs that were associated to the men- 
tioned phenotypic traits were non-synonymously substi- 
tuted (Table 2). Applying a haplotype analysis across 
genes, we could confirm a significant association 
between mean FHB rating and MetAPI and between 
DON production and TRI1 (Additional filel). 

Discussion 

Association mapping based on candidate genes is a pro- 
mising tool for high-resolution mapping of genes contri- 
buting to quantitative traits [23]. Nevertheless, it has not 
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Figure 5 Significant associations in the candidate genes (A) 

Erf2 and MetApl for mean FHB rating and (B) 77?// for DON content. 
The dashed horizontal line indicates the threshold (Bonferroni- 
corrected P < 0.05). 



yet applied to investigate the basis of quantitative varia- 
tion in aggressiveness and/or DON production in F. gra- 
minearum. This is a totally different approach than 
using knock-out mutants [24], because we are aiming 
for the analysis of quantitative differences produced by 
single nucleotide changes of the respective genes in a 
set of 77 isolates. The high number of singletons may, 
besides sequencing noise, be caused by frequent sexual 
recombination of the fungus. Therefore, we used a strict 
threshold and were analyzing only SNPs with allele fre- 
quencies > 0.1. 



Table 2 SNPs in the candidate genes significantly associated with mean FHB rating or DON content 


Trait/Candidate gene 


SNP # 


Position 3 


Polymorphism 


Changes in amino acids b 


P-value 


Pg (%) c 


Mean FHB rating (%) 














MetAPI 


SNP 05 


904 


A, G 


H, R 


1 .28 X 1 0~ 4 


25.6 




SNP 03 


909 


A, C 


T, P 


1 .94 X 1 0~ 4 


0.5 


Frf2 


SNP 47 


1424 


C,T 


Q, Stop 


6.61 X 10~ 4 


13.1 


DON production (mg kg' 1 ) 














77?, 7 


SNP 23 


851 


A, G 


R, Q 


8.62 X 10~ 3 


4.4 



a The position is relative to the start codon (ATG) 

b Different amino acids coded by SNPs located in exon regions (non-synonymous SNPs) 

c The proportion of explained genotypic variance in percent (p G ) after Bonferroni-Holm procedure 
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Choice of candidate genes 

The genes for deoxynivalenol (DON) biosynthesis reside 
primarily in a 25 kb cluster (TRI5 cluster), TRI1 gene 
belong to a second smaller cluster that has a different 
chromosomal localization [16]. TRI1, TRI5, TRI14 have 
a direct effect on the production of DON or acetylated 
DON, TRI6 and TRI10 are regulating this pathway. 
TRIM export the deoxynivalenol outside the mycelia in 
addition to its major role in aggressiveness [15]. 

The function of MetAPl is still not described in 
Fusarium spp., a deletion, however, reduces growth in 
Saccharomyces cerevisiae [18,25,26]. In both, prokaryotes 
and eukaryotes the N-terminal methionine is often 
cleaved by methionine aminopeptidase encoded by the 
gene MetAPl [18]. Erf2 gene is a component of the RAS 
protein subcellular localization pathway in yeast [17]. 
RAS2 gene is known to affect the pathogenesis of F. gra- 
minearum by regulation of hyphal growth and expres- 
sion of hydrolytic enzymes [27]. Failing of 
palmitoylation mainly controlled by the Erf2 gene in S. 
cerevisiae reduces amount of RAS protein at the plasma 
membrane and lead to poor growth of yeast [17]. Both 
genes are not indispensible for fungal growth because 
their deletion does not prevent any growth at least in S. 
cerevisiae. And the 77 isolates analyzed in this study, all 
had a similar growth in vitro, produced enough spores 
for inoculation and were able to infect the host in the 
field. Significant, quantitative differences for aggressive- 
ness and DON production, however, were found illus- 
trating those genes with regulatory functions might be 
involved. Implementing associations between new candi- 
date genes and important phenotypic traits may provide 
a useful tool to rapidly check a possible contribution of 
these genes to QTL of DON production or aggressive- 
ness in F. graminearum as shown previously in mammal 
research [23]. If an association is found, a more detailed 
analysis of the function of the respective genes should, 
of course, follow. 

Molecular diversity of the 77 isolates 

All isolates were checked by respective primers for their 
designation to F. graminearum s.s. Nucleotide diversity 
within candidate genes of this study ranged from 1 to 9 
per 100 bp being high compared to 0 to 2 SNPs per 100 
bp reported by Cuomo et al. [13]. This was not unex- 
pected since Cuomo et al. [13] used just two isolates 
(PH-1 and GZ3639) while in this study 77 isolates were 
analyzed. Many genes in F. graminearum are localized 
in regions of high SNP density, i.e. highly variable 
regions, especially those expressed during plant infec- 
tion. Moreover, it is known that the density of SNPs is 
biased in F. graminearum in a way that 50% of the 
SNPs are within 13% of the genome sequence, the high- 
est SNP density was reported in some regions of 



chromosome II, where most of the TRI genes are 
located [13]. The high nucleotide diversity of these 
groups of genes suggests that the fungus has a great 
capacity for adaptability and genetic change during its 
interaction with the host plant [13]. Despite this, we 
found much more SNPs with allele frequencies > 0.1 in 
the MetAPl and Erf2 genes than in the TRI genes ana- 
lyzed. This was not the case, when just the number of 
detected SNPs is regarded, indicating that nucleotide 
diversity may be similar, but more rare haplotypes occur 
in the TRI genes. Moreover, high SNPs density was 
detected in one or two large interstitial regions on cho- 
mosomes I, II, and IV [13]. 

Population structure and consequences for association 
mapping 

Correcting for population and/or family structure is 
essential for association mapping to decrease the num- 
ber of false positive QTL [28]. An appropriate statistical 
model should provide an excellent compromise between 
correcting for population stratification to decrease the 
probability of detecting false positive SNP-trait associa- 
tions but still retaining enough information within the 
SNPs for QTL detection [29]. Although some grouping 
according to the sampling location might be seen in the 
PCoA, the violin plots did not show any separated knots 
on the first ten principle coordinates (Figure 2). Thus, 
no distinct subpopulations were identified and correc- 
tion for familial relatedness between isolates should be 
sufficient. Therefore, trait-SNP association was investi- 
gated in detail with the K model, which incorporated 
estimates of kinship coefficients based on SSR data. 

Extent of linkage disequilibrium and resolution of 
association mapping 

Rapid decay of regression parameter r 2 in a short physi- 
cal distance of 200 bp within two genes was presented 
in this study (Figure 3), providing a high resolution of 
association mapping. Generally, low LD was reported in 
F. graminearum populations using selection-neutral 
markers (VNTR, AFLP, or RFLP) especially in the 
region that includes the TRI5 gene cluster [11]. More- 
over, the weak correlation between r 2 and physical dis- 
tance in addition to the large proportion of unlinked 
SNP with significant LD, such as SNPs between gene 
pairs TRISI MetAPl, TRI 10 1 MetAPl, or MetAPl /Erf 2 
(Figure 4), suggest the presence of other forces generat- 
ing LD between unlinked SNPs. The values of r" 
between most pairs of SNPs in TRISITRIlOb are > 0.1, 
presenting these two genes in LD, that might refer to 
the controlling role of TRI10 on TRI5 gene cluster 
[24,30,31]. That TRI 10 and TRI5 are located close to 
each other on the same chromosome play a minor role 
hence the detected LD decay is occurring within 200 bp. 
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Selection acting on oligo- or polygenic traits such as 
aggressiveness and DON production might be responsi- 
ble for this [32]; an alternative explanation would be the 
involvement in the same trait network with other phy- 
siologically important genes. 

Association mapping of genes underlying aggressiveness 
and DON production 

We identified three SNPs related to aggressiveness and 
one SNP related to DON production using the fairly 
conservative Bonferroni-Holm correction. Setting the 
threshold for detection of trait associations to an allele 
frequency of 0.1 might underestimate the number of 
QTL detected but is due to the restricted population 
size. Association between SNPs of candidate gene Erf2 
and phenotypic data revealed a significant association to 
mean FHB rating, but not to DON content. This might 
be explained by the fact that the Erf2 gene is involved in 
RAS2 processing or trafficking that precedes palmitoyla- 
tion of RAS2 genes [17]. RAS2 is known to regulate the 
aggressiveness through affecting fungal growth and reg- 
ulating other pathogenicity genes, e.g., Gmpkl, which 
controls the induction of extracellular enzymes required 
for pathogenesis [27]. Further study is needed to under- 
stand how Erf2 affects differences in aggressiveness of F. 
graminearum isolates. 

Two SNPs in MetAPl gene significantly associated 
with aggressiveness were located adjacent to each other 
forming a collinearity pattern. The role of MetAPl was 
reported in S. cerevisiae as reducing cell growth by N- 
terminal protein modification. A similar role might be 
expected for MetAPl in F. graminearum, hence signifi- 
cant positive correlations between aggressiveness and 
fungal biomass (r = 0.7, P = 0.01), and fungal biomass 
and DON content (r = 0.8, P = 0.01) were reported 
among 50 F. graminearum isolates [33]. Nevertheless, 
analyzed isolates were sampled from visually diseased 
spikelets, thus all were able to infect wheat ears and 
induce symptoms, non-aggressive isolates were not 
included in our F. graminearum population sample. 

A single SNP detected in TRI1 was associated with DON 
production in F. graminearum. TRI1 encodes a cyto- 
chrome P450 monooxygenase that catalyzes hydroxylation 
of C-8 position during trichothecene biosynthesis [34]. 
This confirmation change obviously does not affect 
aggressiveness, because the respective SNP is just asso- 
ciated with DON content but not with aggressiveness. 

Conclusions 

This is the first candidate gene association mapping 
study provided insights on some genes involved in 
aggressiveness and DON concentration of F. grami- 
nearum s.s. The described associations should be vali- 
dated using a larger number of isolates and different 



environments. The validated genes are an important 
starting point for further functional analyses. 

Methods 

Fungal material 

Ears of winter wheat (Triticum aestivum L.) showing 
symptoms of FHB were sampled from three commercial 
fields in Germany to establish a fungal population of F. 
graminearum s.s.: Stuttgart-Hohenheim in southwest 
Germany in 2008, Wetze and Schickelsheim in Lower 
Saxony in 2006 and 2007, respectively. From each 
infected head, one isolate was recovered, transferred 
onto a fresh SNA plate and placed under permanent 
UV light for induction of sporulation as described 
recently [3]. From each isolate, one single spore was 
picked out under the microscope and transferred onto a 
fresh SNA plate to establish a single-spore culture. The 
single-spore isolates were checked morphologically and 
analyzed for their species specificity and chemotype by 
different PCR-based assays as described recently in 
detail [3]. In total, 77 single-spore isolates of F. grami- 
nearum s.s. (former lineage 7) were inoculated on the 
moderately resistant German spring wheat cultivar Tai- 
fun (KWS LOCHOW GMBH, Bergen, Germany) for 
analyzing aggressiveness and DON production in the 
field. 

Design of field studies 

Field experiments were planted at each of two locations 
in 2009 and 2010: Hohenheim (HOH, longitude 9° 12' 
58", latitude 48° 42' 50", altitude 400 m) and Oberer 
Lindenhof (OLI, longitude 9° 18' 12", latitude 48° 28' 
26", altitude 700 m), resulting in four environments 
(location x year combinations). Mean annual tempera- 
tures at HOH and OLI were 10.1°C and 9.0°C, respec- 
tively, mean annual precipitations were 644 mm and 
723 mm, respectively, across 2009 and 2010. Plants were 
grown in two-rowed micro-plots of 1 m length and 0.42 
m width. Plots were arranged in a chessboard-like 
design, i.e., each plot with a wheat entry was bordered 
by four plots of similar size that were planted with a 
long-strawed spring triticale cultivar Nilex (NORDSAAT 
GmbH, Halberstadt, Germany) to reduce inter-plot 
interference caused by drifting of inoculum during 
spraying or secondary distribution of spores. The experi- 
ment was arranged in a split-plot design with two repli- 
cations. The main plot factor was the F. graminearum s. 
s. population, the subplot factor the isolate. Both factors 
were assigned to a randomized complete block design. 
Eight plots per replication included in the subplot factor 
were not inoculated to estimate the degree of natural 
infection. 

Inoculum was sprayed with a concentration of 2 x 10 
spores ml" onto the wheat heads of each plot. 
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Inoculation was performed at full flowering time of the 
wheat cultivar to ensure maximum susceptibility of 
wheat to F. graminearum. Fusarium head blight (FHB) 
aggressiveness was rated visually four times as the per- 
centage of infected spikelets per plot (0-100%). This rat- 
ing included both the number of infected spikes per 
plot and the number of infected spikelets per spike. To 
compare FHB reactions, the arithmetic means of four 
ratings were used and assigned as mean FHB rating 
throughout the paper. All plots were harvested by hand, 
threshed, and the grain analyzed to quantify the amount 
of DON by a commercially available immunotest (R-bio- 
pharm AG, Darmstadt, Germany) as previously 
described in detail [9]. This test cannot differentiate 
between DON and 3-ADON, so the results include both 
mycotoxins. Generally, however, the amount of 3- 
ADON is only 2-3% of total DON content [35]. In each 
of the four environments, the natural infection rate was 
very low ranging from 0 to 3% FHB rating and from 0 
to 0.34 mg kg' DON concentration. Prediction of NIV 
chemotype was done using the primers Nil, 15D11, 
3D11, and 11R in a multiplex polymerase chain reaction 
(PCR) as designed by Zhang et al. [36]. 

Detection of population structure and gene sequencing 

In order to analyze the population structure, all isolates 
were fingerprinted with 19 simple sequence repeat (SSR) 
markers dispersed throughout the whole genome follow- 
ing standard protocols [3]. 

Five candidate genes were chosen according to their 
role in trichothecene biosynthesis and/or aggressiveness 
(TRI1, TRI5, TRI6, TRI10, and TRIM). Two other, yet 
uncharacterized candidate genes in Fusarium spp. (Erf2, 
MetAPl) were tested for a possible association with 
aggressiveness. Nucleotide sequences of these genes 
were imported from Fusarium graminearum database 
FGDB [37]. The selected genes were (Table 3): TRI1 
(FGSG_00071), TRI5 (FGSG_03538), TRI6 
(FGSG_16251), TRI10 (FGSG_03538), TRI14 
(FGSG_03543), MetAPl (FGSG_01397), and Erf2 
(FGSG_08531). Specific primers were designed (see 
Additional file 2) to amplify parts of these genes using 
the software Primer Premier 4.0 (Premier Biosoft Inter- 
national, CA, U.S.A.). 

Polymerase chain reaction (PCR) was performed using 
the designed primers related to each gene separately fol- 
lowing standard protocols, however, with a different 
annealing temperature for each gene (Table 3). Purifica- 
tion of PCR products was performed by precipitating 
the DNA with 10% (v/v) of 3 M Sodium Acetate and 
25% (v/v) of absolute cold ethanol overnight. Precipi- 
tated DNA was cleaned twice with 70% ethanol and 
finally diluted in 10 ul ddH 2 0. Purified PCR products 
were sequenced (QIAGEN® Sequencing Services, 



Hilden, Germany). The sequence was performed once 
for each isolate, hence the alignment was mostly identi- 
cal (the sequence was repeated in case it fail to align to 
the other sequence or in case of high noise in the 
sequence). Expected sizes of PCR products were 
obtained for all tested genes. All sequences of TRI6 and 
TRI14 were located in the coding region of the genes 
(exon, see Additional file 1). Sequences of the genes 
TRI5, MetAPl, Erf2, and TRI10 (two parts) were 
stretched over two exons. Sequences of genes were 
obtained from right to left sides of PCR products, just 
genes MetAPl and Erf2 were obtained from left to right 
side. Sequence of TRI1 was located over four exons 
according to the reference sequence. Number of identi- 
fied SNPs was high over all sequenced regions of candi- 
date genes (Table 1). The sequences were aligned 
(according to the initial matrix published in FGDB) 
using CLC sequence viewer 6.3 (CLC-bio, Aarhus, Den- 
mark) to identify single nucleotide polymorphisms 
(SNPs) among the 77 isolates. 

Phenotypic data analyses 

The following linear mixed model was used to estimate 
variance components: j/ y „ = fi+IsOi+Envj+(Iso x Env)^ 
+Rep n +eij n , where [i is the population mean, /so, the 
genetic effect of the ith isolate, Envj the effect of the y'th 
environment, Iso x Env the isolate times environment 
interaction effect, Rep n the effect of the «th replication, 
and eijn the residual error. Variance components were 
determined by the restricted maximum likelihood 
(REML) method using the software ASReml 2.0 (VSN 
International Ltd, Hemel Hempstead, U.K.). Significance 
for variance component estimates was tested by model 
comparison with likelihood ratio tests where the halfed 
P values were used as an approximation [38]. Heritabil- 
ity (h 2 ) on an entry-mean basis was estimated as the 
ratio of genotypic to phenotypic variance according to 
Melchinger et al. [39]. Furthermore, genotypes were 
regarded as fixed effects and best linear unbiased esti- 
mates (BLUEs) were determined for all isolates and 
traits. 

Genetic relatedness among the 77 isolates was deter- 
mined by applying principal coordinate analysis (PCoA) 
[40] based on the modified Rogers' distances of the iso- 
lates [41]. Linkage disequilibrium (LD) between the 
selected SNPs was assessed by the LD measure r 2 [42] 
and significance of LD was tested with Fisher's exact 
tests [43]. LD and PCoA computations were performed 
with the software package Plabsoft [44]. 

Association analysis 

A two-step association approach was applied in this 
study and the BLUEs per environment were used as 
input for the association analysis. The linear mixed 
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Table 3 Name of tested genes, chromosomal localization, primer sequence and expected amplified DNA products 



No. 


Gene ID a 


Chromosome 




Sequences of the primers 


Ta (°C) 


Expected products (bp) 


Gene size (bp) 


1 


77?// (FGSG 00071) 


| 


F 


CACCAGTTTGCAGGATGT 


53.2 


850 


1980 








R 


AATGGGAGTGATTAGTTCG 








2 


TRI5 (FGSG 03537) 




F 


TGGCGGATCTATCTATFCA 


57.1 


750 


1 1 29 








R 


CTFCTTGGCGTCGTCTGT 








3 


TRI6 (FGSG_1625) 




F 


ATGGAGGCCGAATCTCAC 


54.4 


600 


673 








R 


CCACCCTGCTAAAGAGCCT 








4 


TRI10 a (FGSG 03538) 




F 


ATTTCCGAAAGCCTAGACAA 


54.2 


700 


1264 








R 


c~c~c~t~i — ta aty — rrr a a ATrf — r 
bbLLb I AA I L I I LAAA I bb I 








5 


TRI10 b (FGSG_03538) 




F 


GGAGCATTTGAAGATTACGG 


54.2 


550 










R 


CGTCAAGTCTTCCCATGTGA 








6 


TRIM (FGSGJB543) 




F 


AGCCGGAACGCGCTACAT 


57.3 


750 


1116 








R 


CGAACCTGCTGCTCTFAC 








7 


Met API (FGSG_01397) 


I 


F 


AACGATGCCGGATCGTCA 


55.3 


800 


1403 








R 


TFGGCAAACTCTCGGATGAA 








8 


Erf2 (FGSG_08531) 


IV 


F 


AGGCATCmGTFGTTGTG 


52.4 


750 


2040 








R 


rFGGTAATAGGTGGGTFGT 









5 The given gene ID is the entry number of the MIPS F. graminearum genome database (FGDB; Wong et al. 2011) 



model for the association approach was: y ijn = fi+a p +IsOi 
+EnVj+eij p , where a p is the effect of allele p. The allele 
effect a p was modeled as fixed effect whereas /so, and 
EnVj were regarded as random effects. We assumed that 
the variance of the random genetic effect was Var(g) = 
2Ka 2 g , where a 2 g refers to the genetic variance estimated 
by REML and K was a 77 x 77 matrix of kinship coeffi- 
cients that define the degree of genetic covariance 
between all pairs of entries. We followed the suggestion 
of Bernardo [45] and calculated the kinship coefficient 
K« between isolates i and / on the basis of the SSR mar- 
ker data as Ku = l+(5»-l)/(l-7^), where S i; is the pro- 
portion of marker loci with shared variants between 
isolates i and /, and T ; y is the average probability that a 
variant from one isolate i and a variant from one isolate 
/ are alike in state, given that they are not identical by 
descent. The coefficient T,y was estimated separately for 
each gene and trait using a REML method setting nega- 
tive kinship values between isolates to zero. SNPs with 
allele frequencies < 0.1 were not considered in the asso- 
ciation analysis or LD estimation. The obtained opti- 
mum T values were for DON content: TRI1 (0.025), 
TRI5 (0.575), TRI10 (0.375), Met API (0.300), and Erf2 
(0.400). For mean FHB rating: TRI1 (0.600), TRI5 
(0.150), TRI10 (0.200), MetAPl (0.075), and Erf2 (0.275). 
For the detection of main effects of the candidate gene 
SNPs, these were fitted as fixed effects in the mixed 
model and their significance was tested by a Wald F 
test. Based on the Wald F statistic, we performed tests 
for the presence of marker-phenotype associations with 
a significant (P < 0.05) effect on DON content and 
mean FHB rating applying the Bonferroni-Holm proce- 
dure [22] to correct for multiple testing. 



The proportion of genotypic variance (p G ) explained 
by the detected SNP was calculated by fitting each SNP 
in a linear model to obtain R 2 a dj- The ratio po = R 2 a dj/ 
h 2 yielded the proportion of genotypic variance [46]. In 
the case of MetAPl, where two SNPs were detected for 
mean FHB rating, both were simultaneously fitted in the 
linear model in the order of their P values to correct for 
collinearity. Haplotype analyses were performed with the 
same procedure to test the association with both pheno- 
typic traits (Additional file 2: Table SI). We applied the 
same threshold level like for SNP analyses. All mixed 
model calculations were performed using the software 
ASReml 2.0 (VSN International Ltd, Hemel Hempstead, 
U.K.). 

Additional material 



Additional file 1: Genomic organizations of the seven candidate 
genes with exons in black color. The sequenced region is shown as a 
green bar above each gene. 

Additional file 2: Table SI. Haplotype analysis of the genes MetAPl and 
TRIP 
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